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Abstract 

Convergence rates results for Tikhonov regularization of nonlinear ill-posed op- 
erator equations in abstract function spaces require the handling of both smoothness 
conditions imposed on the solution and structural conditions expressing the charac- 
ter of nonlinearity. Recently, the distinguished role of variational inequalities holding 
on some level sets was outlined for obtaining convergence rates results. When lower 
rates are expected such inequalities combine the smoothness properties of solution 
and forward operator in a sophisticated manner. In this paper, using a Banach space 
setting we are going to extend the variational inequality approach from Holder rates 
to more general rates including the case of logarithmic convergence rates. 
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1 Introduction 



With the monograph [8] Charles Groetsch presented an extremely well-readable in- 
troduction to the theory of Tikhonov regularization of ill-posed operator equations in 
Hilbert spaces. For linear ill-posed problems in that book the ingredients and conditions 
for obtaining convergence rates, the role of source conditions and the phenomenon of sat- 
uration are outlined. The ill-posedness of a linear operator equation describing an inverse 
problem with 'smoothing' forward operator in Hilbert spaces corresponds with the fact 
that the Moore-Penrose inverse of the forward operator is unbounded and only densely 
defined on the image space. In that sense, solving linear ill-posed problems based on 
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noisy data can be considered as the application of that unbounded operator to such data 
elements. For further theoretic extensions we refer to the recent monograph [9]. In 1989 
Engl, Kunisch, and Neubauer published a seminal paper [5j on convergence rates re- 
sults for the Tikhonov regularization of nonlinear ill-posed problems in the Hilbert space 
setting (see also jH, Chapter 10]). After the turn of the millennium motivated by specific 
applications, for example in imaging, there occurred numerous publications on the Ba- 
nach space treatment of linear and nonlinear operator equations including convergence 
rates results (see, e.g., P d EDI E31 EH EZ|). Initiated by the paper [2j of Burger and 
Osher Bregman distances were systematically exploited for evaluating the regularization 
error. Because of a completely different methodology for obtaining convergence rates in 
(generalized) Tikhonov regularization we have to distinguish low rate results up to Breg- 
man errors of order 0(5) for the noise level 5 > and enhanced rate results up to the 
Bregman saturation order 0(<5 4 / 3 ). Recently, in the papers [TBI HE] j moreover in (6J [12], in 
the thesis [22] and in the monograph [25] by Scherzer ET AL. the distinguished role of 
variational inequalities for proving low rate convergence rates of Holder type was worked 
out. This paper tries to extend the variational inequality approach to obtain more general 
Bregman rates of form 0(<p(5)) with concave index functions ip. This includes the case of 
logarithmic convergence rates (see the papers [321 [18] by Hohage and Kaltenbacher). 

The paper is organized as follows: In Section[2]we present a general setting of Tikhonov 
type variational regularization with convex stabilizing penalty functional and strictly con- 
vex index function of the residual norm that characterize with positive some regularization 
parameter the sum functional to be minimized for obtaining stable approximate solutions 
of the nonlinear ill-posed problem under consideration in a Banach space setting. The 
standing assumptions of the setting and assertions on weak convergence and level sets 
are also outlined in Section El The subsequent Section [3] discusses structural conditions 
on the nonlinearity of the problem and source conditions as well as approximate source 
conditions imposed on the solution. The first main result yielding an extension of the 
variational inequality approach from convergence rates results of Holder type to results 
for general convex index functions is formulated and proven as Theorem 14.31 in Section [H 
As an essential ingredient the proof applies a generalization of Young's inequality. The 
second main result will be given in the concluding Section [5] by the couple of Theorems l5.ll 
and 15.21 that provide us with sufficient conditions for obtaining the more general varia- 
tional inequalities required in Theorem 14.31 The canonical source condition for low rates 
in Banach spaces and distance functions for measuring its violation form the basis for 
that conditions. 



2 Problem setting and assumptions 

In this paper, ill-posed operator equations 

F{x)=y (2.1) 

are under consideration, where the operators F : T>(F) C X — > Y with domain T>{F) are 
mapping between real Banach spaces X and Y, respectively. For some noise level 5 > 
let y s denote noisy data of the exact right-hand side y = y° E F(T>(F)) with 

\\y 5 -y\W<6. (2.2) 
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with a misfit function if) : [0, oo) — > [0, oo) and a penalty functional Q : P(O) C X — > 
[0,oo). The set of admissible elements for the minimization of (12.31) is the intersection 
V := P(.F) fl V(Vt) of the occurring domains. 

Index functions play a central role in our considerations. Originally coming from the 
theory of variable Hilbert scales and expressing the function- valued index of such a scale 
element (see [10, 14]) we use this concept as follows: 

Definition 2.1 We call a real function r] : [0, 00) — > [0, 00) (and also its restriction to 
any segment [0, a] (0 < a < 00) ) index function if it is continuous and strictly increasing 
with 77(0) = 0. 

Note that for index functions 77, 771, 772 also the inverse function r/ -1 and the antideriva- 



tive 6(s) := f r](t)dt are index functions, furthermore also all positive linear combinations 



XiT]i + A 2 ?72 (Ai, A 2 > 0, \f + \% > 0) and compositions 771 o r] 2 . 
Throughout this paper we make the following assumptions: 

Assumption 2.2 

1. X and Y are Banach spaces with topological duals X* and Y* , respectively, where 
II ' \\x, || ' \\y and (-,-)x*,x and (-,-)y*,y denote the associated norms and dual 
pairings. In X and Y we consider in addition to the strong convergence — > based 
on norms the weak convergence — ^ based on the weak topology. 

2. F : T>(F) C X — > Y is weakly-weakly sequentially continuous and T>(F) is weakly 
sequentially closed, i.e., 

Xk — x in X with x^ € T>(F) x G V(F) and F(xk) — F(x) in Y. 

3. The set T>(Vt) is convex and the functional Q is convex and weakly sequentially lower 
semi- continuous. 

4- The domain V := T>(F) fl P(O) is non-empty. 

5. For every a > 0, c > 0, and for the exact right-hand side y = y° of (GO]), the sets 



are weakly sequentially pre-compact in the following sense: every sequence {xk}f° = i 
in A4 a (c) has a subsequence, which is weakly convergent in X to some element from 



s 







M a {c) :={xEV: r°(x) < c 



} 



(2.4) 



X. 
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6. ip : [0, oo) — > [0, oo) is an index function with the property that there exist a = 
a(ip) > 0, b = b(ip) > fulfilling 

ip(u + v ) < aipiu) + bip(v) Vtt, v G [0, oo). (2.5) 

One should notice that item 6 in Assumption [221 is fulfilled in case ip : [0, oo) — > [0, oo) 
is a p-homogeneous (with p > 0) and convex index function. We recall the ip is said to 
be p-homogeneous (with p > 0) whenever for all x G [0, +oo) and all t > it holds 

ij,(tx) = t p ip{x). 

Under the stated assumptions existence and stability of regularized solutions x s a can 
be shown in the lines of the proof of [25j Theores 3.22 and 3.23] (see also [13], Section 3]). 

For the convex functional Q with subdifferential dfl regularization errors in a Banach 
space setting are frequently measured by means of Bregman distances 

D^x, x) := tt(x) - ft(ar) -(£,x- x) x ,^ x , x G V{tt) C X , 

at x G V(Q) C X and £ G <9fi(x) The set 

X> B (fi) := {x G : dtt(x) ^ 0} 

is called Bregman domain. An element x^ G T> is called an Q-minimizing solution to (|2.ip 
if 

f2(x^) = min{f2(x) : F(x) = y, x G P} < oo . 

Such ^-minimizing solutions exist under Assumption 12.21 if (12. IK has a solution a:' in P. 
This can be shown in analogy to the proof of [25j Lemma 3.2]. 

We close this section by proving that the regularized solutions associated with data 
possessing a sufficiently small noise level 6 belong to a level set like the one in (12.4ft , pro- 
vided that the regularization parameters a = a(5) are chosen such that weak convergence 
to ^-minimizing solutions x' is enforced. 



Proposition 2.3 Consider an a priori choice a = a(8) > 0, < 5 < oo, for the 

regularization parameter in (12.31) depending on the noise level 5 such that 

a{5) -> and -> 0. (2.6) 
a{o) 

Provided that ( 12.41) has a solution x' in V then under Assumption \2.2i every sequence 
{ x n}^=i := { x a(Sn)Y n }^=i of regularized solutions corresponding to a sequence {y 5n }'^L l 
of data with lim^oo^ = has a subsequence {x nk }^ 1) which is weakly convergent in 
X, i.e. x nk — ^ x' and its limit x* is an Q-minimizing solution of (12 .4f) with Q(x') = 
lim fc ^ 00 fi(x n J. 

For given a max > 0, let x* denote an Q-minimizing solution of (12. 4p . If we set 

P = amax(l + ^(x t )), (2.7) 

then we have x> G -M Qmax (p) and there exists some <5 max > such that 

x S a{S) e M an ^(p) for all0<5< 5 max . (2.8) 
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Proof: The first part of the proposition can be proved in the same manner as [251 
Theorem 3.26]. Here the properties of the index function ib play a determinant role. 

We come now to the second part of the above statement and consider an a max > 0. 
Because of (12.61) there exists some 5 max > such that a (5) < a max and ^© < min{|, ^} 
for all < 5 < 5 max . In the following we write for simplicity a instead of a(S). 

For all < 5 < 5 max , by ((231), it holds 

TLj^a) = ^(WHxl) ~ V\W) + CW^(4) < 0^(11^(4) - V S Wy) + WW + <Wfi(*£) 

= a[^(\\F(x 5 a ) - y 5 \\ Y ) + an(x 5 a )\ + b^{5) + (a max - aa)Q(x s a ) 
< aT s a (x ] ) + bip(S) + (a max - aa)Q(x 6 a ) < (a + b)ip(5) + aaVt(x ] ) + (a max - aa)Q(x s a ). 
On the other hand, from T^(x 5 a ) < T^(x^) it yields O(i^) < ^p- + {l(x^). Consequently, 

T° (x 5 a ) <(a + b)ip(S) + aan(x^) + - a) ^{S) + (a max - aa) Sltf) 

max \ a / 

= hp(5) + -=^(5) + a max n(x^ < a max (l + Q(x*)) = p. 
a 

□ 



3 Source conditions and structural conditions of non- 
linearity for the Banach space setting 

To obtain convergence rates for Tikhonov regularized solutions in the case nonlinear ill- 
posed problems an appropriate interplay of solutions smoothness, if possible expressed by 
source conditions for x\ and of the structure of nonlinearity of F in a neighborhood of 
x^ is required. In this context, we are going to restrict the situation a little bit more as 
follows: 

Assumption 3.1 

1. F,fl,V,X andY satisfy the Assumption \2.2\ 

2. Let x' be an Q-minimizing solution of [2.1\) . 

3. The operator F is Gateaux differentiable in x' with the Gateaux derivative 
F'(x') G C(X,Y) (C(X,Y) denotes the space of bounded linear operators from X 
toY). 

4- The functional Q is Gateaux differentiable in x' with the Gateaux derivative 
£ = Q!(x^) G X* , i.e., x 1 " G V B (Q) and the subdifferential dil(x^) = {£} is a 
singleton. 



5 



In the case of Hilbert spaces X and Y by spectral theory one can consider bounded 
linear operators r](F'(x^)*F'(x^)) G C(X,X) for any index function n based on the fact 
that with the Hilbert space adjoint F'(x^)* G C(Y,X) of F'(x r ) G C{X,Y) the operators 
F'(x^)*F'(x^) G C(X,X) are non-negative and self-adjoint and this property carries over 
to the operators T](F r (x^)*F'(x^)). For Banach spaces X and Y, however, only the Banach 
space adjoint G C(Y*,X*) of F'(^) is available, but F r (x^)*F'(x^) and hence 

r/(F'(a;^)*F'(a;^)) are not well-defined. In contrast to the Hilbert space setting, where 
generalized source conditions 

£ = r](F'(x^)*F'(x^))v, veX, (3.1) 

can be exploited for arbitrary index functions 77, in our Banach space only the source 
condition 

£ = F'(x^)* w, weY*, (3.2) 

expressing a medium smoothness of £ has canonical character. We will consider this as an 
upper benchmark source condition here accepting that only low and medium convergence 
rates for the regularized solutions are under consideration. For expressing higher solution 
smoothness with respect to the stabilizing functional Q duality mappings can be helpful 
admitting enhanced convergence rates. For that we refer for example to the papers [TT1 
l20l |2T| . but we note that the higher source conditions used there seem to be a little 
bit artificial. Searching for low rate results in Banach spaces X and Y with solution 
smoothness limited by (13.21) our main drawback is the non-existence of generalized source 
conditions (13.11) with concave index functions 77 such that \ft = 0(rj(t)) as t — > 0. This 
class of index functions includes for < v < 1/2 the monomials 

rj(t) = t u (t> 0) (3.3) 

and for all \i > the family of logarithmic functions 

Jo (t = 0) 

VW ~ { [log(l/t)]-^ (0 < t < e-^ 1 ) ' ( ' ' 

Since Schock's paper [26] we know that convergence rates of regularized solutions can be 
arbitrarily slow. This corresponds with arbitrarily weak solution smoothness. For example 
the very low multiple logarithmic rates for associated generalized source conditions with 
index function r](t) = log log ... log(l/t) really occur in applications of the Hilbert space 
theory. 

One way of compensating the Banach space drawback of missing generalized source 
conditions consists in applying the method of approximate source conditions (see [3l [T2]) 
whenever £ fails to satisfy the benchmark source condition (I3.2p for any w G Y, but the 
obviously non-negative and non-increasing distance function 

d{R) :=mf{\\£- F'{x j )*w\\x* : w G Y* , \\w\\ Y * < R} , (3.5) 

which is well-defined for all R > 0, fulfills the limit condition 

lim d(R) = . (3.6) 
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We notice that on the one hand d : [0, oo) — > [0, oo) is continuous, being convex, and on the 

other hand (13.60 is fulfilled if and only if £ £ lZ(F'(x^)*) " x . By a separation theorem 
one can prove that the latter is guaranteed provided F'(x^)** is injective (respectively, 
F'(x>) is injective, if X, Y are reflexive Banach spaces). For A £ C(X,Y) we denote by 
A** £ £(X**,F**), defined by (A**x*\ y*) Y **,Y* = (x**, A*y*) x » x * for x** £ X** and 
y* £ y*, its bi-adjoint operator. 

The following lemma will be used in order to guarantee that the distance function 
defined in (13.51) strictly decreasing. 

Lemma 3.2 Let X, Y be reflexive Banach spaces and A £ C(X, Y) an injective operator. 
For £ £ X* we assume that £ ^ 1Z(A*). Then the distance function d : [0, +oo) — ► 
(0, +oo), defined by 

d(R) = inf{||£ - A*w\\ x * : w £ F*, ||w|| y . < i?}, 
zs strictly decreasing. 

Proof: First let us notice that for all R > there exists w £ Y*, ||«j||y* < R, such 
that d(R) = ||£ — v4*w||x*. This is because of the fact that the dual norm function 
is weak* lower semicontinuous and the unit ball in Y* is weak* compact (Theorem of 
Alaoglu-Bourbaki) . 

Let be R > 0. Next we prove that if for w £ Y*, \\w\\ Y * < R, it holds d(R) = 
||£ — A*w||x*, then one necessarily must have ||w||y« = R. In case R > 0, this fact is 
obvious. Suppose now that R > 0. Indeed, in this case w is an optimal solution of the 
convex optimization problems 

inf U-A*w\\ x *. 

\\w\\ Y *-R<0 

As the Slater constraint qualification is fulfilled (for w' = we have ||w'||y* — R < 0), 
there exists a Lagrange multiplier A > such that (see, for instance, [29, Theorem 2.9.2]) 

A(||w||y. -R) = 

and 

0ed(U-A*(.)\\ x * + \(\\.\\ Y *-R))(w). 

If we prove that A > 0, then the assertion follows. We assume the contrary. This means 
that 

0ed(U-A*(.)\\ x *)(w). 

Next we evaluate the above subdifferential. Let be L : X* — ► M, L(w) = ||£ +iu||x*> Since 
L is continuous, by [29], Theorem 2.8.2] we have that 

d(U - A*(-)\\ x *)(w) = d(L o (-A*))(w) = -A(dL(-A*w)). 

As A is injective, 

0edL(-A*w)=d\\-\\ x *(Z-A*w). (3.7) 
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For the subdifferential of the norm we have the following expressions 

d\\ ■ |M«) = {uEX: \\u\\ x < 1}, if v = 0, 

while 

d \\ • \\x*( v ) = {uE X : \\u\\ x = 1, (v,u) x *,x = \\v\\x*h ^ v ^ 0. 

By (13. 7ft it follows that only the first situation is possible. Consequently, £ — A*w = 0. 
But this is a contradiction to £ ^ 7?. (A*). Thus A > and, so, ||u>||y* = -R. 

Let us prove now that d is strictly decreasing. To this aim take R±, R 2 G [0, +00) such 
that < Rx < R 2 . It holds d(R 1 ) > d(R 2 ). Assume that d(R\) = d(R 2 ). Then there ex- 
ists u>i, w 2 G Y*, ||wi||y* = Ri, ||w2||y* = R2, such that d(R\) = d(R 2 ) = \\^ — A*wi\\x* = 
— A*w 2 \\x* ■ As ||wi||y* < R 2 , this leads to a contradiction to the above considerations. 
Consequently, d{R\) > d(R 2 ) and this concludes the proof. □ 

Let us mention that the decay rate of d(R) — ► as R — > 00, as assumed in (|3.6I) . 
expresses for the element £ the degree of violation of (|3.2I) and thus it can be handled as a 
replacement information for the missing index function 77 from (|3.ip in the Banach space 
setting. 

As an adaption of the local degree of nonlinearity introduced for a Hilbert space setting 
in pj2, Definition 1] to the Banach space situation with Bregman distance we suggested 
in [T6l Definition 3.2] a definition, which attains here under Assumption 13.11 the form: 

Definition 3.3 Let < cx,c 2 < 1 and < c\ + c 2 < 1. We define F to be nonlinear of 
degree (ci,c 2 ) at for the Bregman distance D^(-,x^) of Q with £ = Q!{x^) if there is a 
constant K > such that 

\\F(x) — F(x*) — F'(x^)(x — x*)\\ Y < K \\F{x) Dfax*)'* (3.8) 

for all x G M amax {p). 

In [12] it was shown that the method of approximate source conditions yields con- 
vergence rates of Tikhonov regularized solutions x s a minimizing ( 12.30 with misfit function 
ip{t) = t p (p > 1) whenever we have c\ > in the degree of nonlinearity and if £ fails 
to satisfy the benchmark source condition (I3.2p . The corresponding rates depend on the 
distance function (I3.5p . If C\ > and the source condition (13.21) holds, then we even 
obtain Holder convergence rates with Holder exponents k = (see [K]). If the non- 
linearity of F at x^ is such that c\ > cannot be satisfied, then rate results are only 
known for c 2 = 1 and ( I3.2p under the additional smallness condition < 1 (see 

[25]). As already mentioned in [18J for the Hilbert space setting, there seem to be no rate 
assertions for c\ = 0, c 2 = 1 if £ fails to satisfy the benchmark source condition (13 . 21) . i.e., 
low rate results including results on logarithmic convergence rates are missing for c\ = 
in case of absence of the source condition (13.20 . However, if we cannot find a c\ > it is 
an interesting open question whether low rate results can be derived if the structure of 
nonlinearity only satisfies the weaker condition 

\\F'(x^)(x - x^)\\ < C a(\\F(x) - F(x^)\\ ) (3.9) 
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for all x G M. amax {p) with some constant C > and some index function a. We will 
attack this task in the next section using variational inequalities as tool. In this context, 
let us note that the validity of (13.91) for a(t) = t Cl and < c x < 1 implies with the triangle 
inequality that we have 

\\F(x) - F(x*) - F'(x ] ){x - x ] )\\ y < \\F'(x ] ){x - x+)|| y + \\F{x) - F(x ] )\\ Y 

< C \\F{x) - F(x ] )\\y + \\F(x) - F{x^)\\ Y < K \\F{x) - F(x^)\\ Y 

on the associated level sets which shows a degree (ci,0) of nonlinearity. In general we 
conjecture that only concave index functions a are of interest in the condition (13.91) . 



4 Variational inequalities and convergence rates 

In recent publications (see [121 El EH EH]) variational inequalities of the form 

{i,x^ -x) x ^ x <(5 x D^x^) + p 2 \\F(x) - F{xt)\\ Y forall x e M amax (p) (4.1) 

with two multipliers < (3i < 1, /3 2 > and an exponent k > have been exploited for 
obtaining convergence rates in Tikhonov regularization in Banach spaces, where in the 
functional (12.31) to be minimized the strictly convex misfit function ip(t) = t p (p > 1) was 
used. We repeat in our context the Proposition 3.3 from [16]: 

Proposition 4.1 Set ip(t) := t' p (p > 1) in (2.3) and assume that F, f2, V, X, Y, x^ and 

£ satisfy the Assumption \3.1i If there exist constants < (3\ < 1, fa > 0, and < 
K < 1 such that the variational inequality H4-l\ ) holds with p from \2. 7)j . then we have the 
convergence rate 

D^i(S)^ r ) = £>(n as 5^0 (4.2) 
for an a priori parameter choice a (5) x 5 P ~ K . 

The proof of this proposition is based on the inequality T^(x s a ) < T^x^) that holds for 
all regularized solutions x s a and on the variant 

If 2 11 
ab<a p '+ a,6>0, Pl ,p 2 > 1 with — + — = 1) (4.3) 

Pi P2/Pl p 2 Pi Pi 

of Young's inequality. Note that due to Proposition 4.3 in [16] the case k > 1 is not of 
interest, since ( 14.11 ) with k > 1 implies the singular case £ = 0. 

For obtaining more general low order convergence rates we change (14. 1 1) as follows: 
We assume that there holds a variational inequality 

(£,xt-x) x , x <p 1 Dt{x,xi) + p 2V (\\F(x) - F(x^)\\ Y ) forall x e M amax (p) 

(4.4) 

with two multipliers < Pi < 1, fa > and an index function ip. 
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Assumption 4.2 Regarding the functions ip from \2. 3 I J and ip from H4-4\) we make the 
following assumptions: 

1. if) and ip are index functions which are twice differentiate on the interior of their 
domains. 

2. ip is strictly convex and ip is concave. 

Under Assumption 14.21 we can define another index function / as follows: 



/(0) = and f(s) 



[s) when s > 0. (4.5) 



Let us first show that / is well-defined, by proving that (f'(s) > when s > 0. Indeed, 
suppose that there exists s > in the interior of the domain of ip such that <p'(s) = 0. 
Take t > s. By the concavity assumption one has 

= <p'(s)(t-s)><p(t)-<p(s), 

which contradicts the fact that ip is strictly increasing. 

By employing similar arguments, since if) is convex, whenever s > one has that 
if)'(s) > and so f(s) > 0. As the continuity of / is automatically satisfied, in order to 
prove that / is an index function, one only needs to show that / is strictly increasing. 

Take < si < S2- Then ip^ 1 (si) < ip^ 1 ^). As if) is strictly convex, if)' is strictly 
increasing and so < ^'(^~ 1 (s 1 )) < ^'(^^ 1 (s 2 ))- On the other hand, since ip is concave, 
ip' is non-increasing, consequently, (/) / (y9 _1 (s 1 )) > y9 / (y9~ 1 (s 2 )) > 0. From here one has 
f(s%) < f(s 2 ). Hence / is an index function and so is the antiderivative 

s 

H(s):= J f(r)dr. (4.6) 
o 

¥>(*) 

From ((US} it follows that for s > ifj{s) = J f(t)dt + C. As ^(0) = 0, this yields 

o 

C = and, consequently, 

tp(s) 

iP(s) = H{<p{s)) = [ f(t)dt. 





Now aspects of the interplay between ip, if, f and H can be written in different manner 
by the equations 

if> = H o ip, H — ip o ip^ 1 

and 

/W = [^rtW ( s >o), 

where the last equation yields (14.51) by differentiation and use of the chain rule. Further, 
let 

s 

GOO := J r\r)dr (4.7) 
o 
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be the antiderivative of the inverse function to / and one can verify the cross-connections 

H = Go f and G~ l o $ = f o <p . (4.8) 

Now we are ready to present the main convergence rate result of this paper: 

Theorem 4.3 Assume that F, Q, V, X, Y, x* , £ and ip satisfy Assumption ^ . 1\ and assume 
that ip and if satisfy Assumption \4.^ which ensures the existence of an index function f 
defined by (^.5[ ). Let there exist constants < (3\ < 1, (3 2 > 0, such that the variational 
inequality j4-4\) holds with p from \2. 71 j . Then we have the convergence rate of Tikhonov 
regularized solutions 

D&tiS),^) = 0{<p{8)) as 6^0 (4.9) 
for an a priori parameter choice 

a(S) = 4-/(^(0)- ( 4 -!0) 



Proof: For all a > regularized solutions x s a minimizing (12.3P have to satisfy the in- 
equalities T^(x 5 a ) < T*(x^). Using the definition of the Bregman distance this implies for 
the noise model ( 12.21 ) the estimate 

i>(\\F(x s a ) - y 5 \\ Y ) + aD^xl x*) < if>{8) + a (n(a;t) - n(s£) + D^x 5 a , x*)) . (4.11) 

Moreover, from the variational inequality ( 14.41) we obtain that 

fi(xt) - Q(x 5 a ) + xt) = - (£, a£ - xt> x%x 

< + /? 2 ^(||F(4)-F(xt)|| y ). 

Therefore from (14.111) it follows that 

^\\F(x s a ) -y 5 \\ Y ) + aDs(xlJ) < ^(5) + afcDfcl x+) + «^(||F(x 5 J - F(^)|| y ) 

(4- 

= ^(5) + afcD&i, x+) + -(aaftM||F(a£) - F(^)|| 

a 

Using the generalization of Young's inequality (see, for instance, [19]) for the index func- 
tion / we obtain for sufficiently small a > 

<p(\\F(xi)-F(xl)\\ Y ) aafc 

(aa[1 2 M\\F(xi) - F(J)\\ Y ) < J f(t)dt+ J f-\r)dr 



(4.13) 

= H{ V {\\F{xi)-F{x^\\ Y ) + G{aal3 2 ) = i>{\\F(x s a ) - F{J)\\ Y ) + G{aaf5 2 ). 
From (14T21) and (14T31 ) it follows that 
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^(\\F(xi)-y s \\ Y ) + aD^xi,x^ 

< ^{5)+a(3 l D i {xi,x^) + -iP(\\F(x 6 a ) - F(x^)\\ Y ) + -G(aaP 2 ) 

a a 

< i>(5) + afaDfci, xt) + i/>{\\F(x s a ) - y 5 \\ Y ) + -i/>(S) + - G(aa(3 2 ). 



Consequently, 




1 



(a + b)if>(6) + G(aa/3 2 ) 



(4.14) 



(1 - Pi)a 



a 



for sufficiently small a > 0. 

Next we prove that for a (8) := it holds 




(4.15) 



for sufficiently small 8 > 0. Indeed, (14.15R is equivalent to 

(a + b + l)ap2<p(8)a(5) - (a + b)ip(8) - G(a(5)a(3 2 ) > 



(4.16) 



for sufficiently small 5 > 0. Denote by K(6) := (a + b + l)a/3 2 V 9 (^) Q; (^) — (a + b)tp{5) — 
G(a(5)a(3 2 ). One has K(0) = 0. We prove that K'{8) > 0, for sufficiently small 5 > 0, 
and this will have as consequence the fact that K(S) > K(0) = 0, for sufficiently small 
5 > 0. Indeed, one has for sufficiently small 8 > 



for sufficiently small 8 > and some constant c > 0. □ 

We should note here that because of (14. 8K ct(<5) = ~^f{ ( p{^)) = denotes 
an equilibration up to a constant of the two terms in the numerator of the second fraction 
in (j4.14ft . This order equilibration corresponds with the standard approach for obtaining 
such convergence rates. 

Example 4.4 We conclude this section with the example situation of monomials (power 
functions) <p(t) = t K (0 < K < 1) and tp(t) = t p (p > 1) discussed in [TB] for which 
Proposition 14.11 was repeated above. Then our assumptions are satisfied and we have 



H(t) = t p/K , f{t) = - t ip ~ K)/K } G(t) ~ t p/ip - K \ a(8) ~ 8 p - K , D^x 5 a{s) ,x^) = 0(8 K ) . 



We would like to notice that one comes to the same conclusion also in the case < k < 
p < 1 discussed in [6]. The reason therefore lay in the fact that / remains an index 
function and, consequently, Theorem 14.31 is still applicable, even if in this situation ip fails 
to be strictly concave. In fact, in order to obtain the convergence rate (14.91) in Theorem 
!4.3l one needs only to guarantee that the function / defined as in (I4.5P is an index function 
which is differentiable on the interior of its domain. This happens when Assumption 14.21 
is satisfied, but can be the case also in other settings. 



K'{8) = (a + b + l)ip'(8)f(<p(8)) + (a + b + l)<p(8)f'(<p(8W(8) 

-(a + %w(v^)) - r\f{vmnvW)AS) 

= <p'(6)f(ip(6)) + (a + bM8)f'&(8W(8) > 0. 



Thus (14.151) holds and this yields the estimate 



Ds(x 5 a{s) ,xi)<c <p(8) 
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5 Variational inequalities based on canonical source con- 
ditions and approximate source conditions 



In this section we are going to formulate sufficient conditions for variational inequalities 
(14.41) when only some weak structural assumption of the form (I3.9P on the nonlinearity of 
F with concave index function a is imposed. 

Theorem 5.1 Assume that F,Q,V,X,Y,x^,C, and if> satisfy the Assumption \3.1i 
Let £ satisfy the canonical source condition $3.B) and the structural condition $3.91) with 
some index function a and some constant C > for all x G M. amax (p) . Then a variational 
inequality $4-4\ ) holds with two multipliers < fa < 1, fa > and with the index function 
if = a. 

Proof: Owing to (13 . 2H and (13.91) we can estimate for all x G M-a max (p) as 
(6, - x) x ^ x = (F'(x^)*w, x ] - x) x ,^ x = (w, F'(x ] )(x ] - x)) Y „ Y 

< \\w\\ Y *\\F' (x ] ){x - x ] )\\ y < C\\w\\ Y *o-(\\F(x) -F(x ] )\\ Y ). 

This, however, yields the variational inequality ( 14.41 ) with fa = < 1, fa = C \\w\\ Y * and 
with ip = a, where a is the index function from ( 13.90 . This proves the theorem. □ 



Theorem 5.2 Assume that X,Y are reflexive Banach spaces, F, £1, T>, X, Y, x', £ and if) 
satisfy the Assumption \3.1\ and F'(x') is an injective operator. Let £ ^ TZ(F'(x*)*). 
Moreover, assume that the structural condition $3.91) is fulfilled with some index function 
o and some constant C > for all x G M. amax (p) an d that the Bregman distance is locally 
q-coercive with 2 < q < oo, i.e. there is some constant c q > such that 

D$(x, x^) > c q \\x — x^\\ x (5.1) 

holds for all x G •Ma ma x(p)- Then a variational inequality $4-4\) holds for all 
x G M-otmaxip) with two multipliers < fa < 1, fa > and with the index function 
<p(0) = 0, <p(t) = [di^iait)))] 9 * (t > 0), where \ + ^ = 1 and V : (0, oo) -> (0,oo), 

Proof: Instead of (13.21) we have here for all R > the equations £ = F'(x^)*wr + r R 
with 1 1 i^ftlly* < R and ||r^|| x » = d(R). Then we can estimate for all x G M-a max {.P) D Y 
using (13.91) as 

(f , x ] - x) x ^ x = (F'(x^)*w R + r R , x ] - x) x ^ x < (w R , F'(x ] )(x ] - x)) Y ^ y +(r R , x f - x) x ^ x 

< R \\F'(x^)(x - x^)\\ Y + d(R) \\x - < RCa(\\F(x) - F(x j )\\ Y ) + d(R) \\x - x*\\ . 
Now for q and q* adjoint exponents with 1/q + 1/q* = 1 the inequality 

(£, x ] - x) x , x < RCa(\\F(x) - F{x r ) \\ y ) + c~ xlq d(R) D^x, x ] ) 1/q 
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obtained from (15.11) can be further handled by using Young's inequality in the standard 
form 

a pi If 2 11 

ab < 1 (a, b > 0, pi,p 2 > 1 with 1 = 1) 

Pi P2 Pi Pi 

when setting a := D^(x, x^), b := Cq 1 ^ d(R), pi := q, p 2 : = q* ■ In that way we derive for 
all R > 

1 c~ q * /q 
(£,x f -x) x , x < RC a{\\F{x) - F{x ] )\\ Y ) + -D^x,x f ) + d(R) q * . 

The continuity of d carries over to the auxiliary function ^ : (0, oo) — > (0, oo), ^(R) = 

d ^ q , which is continuous and strictly decreasing, and which fulfills \im R ^ fy(R) = oo 
and Hindoo \I/ (R) = 0. Its inverse : (0, oo) — > (0, oo) is also continuous and strictly 
decreasing and for all t > the equation ^(R) = cr(t) has a uniquely determined solution 
R > 0. Note that for rates results only sufficiently small t > are of interest. Setting 
R := (<j(||F(x) — F(a^)|| y )) we get some constant C > such that the variational 
inequality 

(£, xt - x) x ^ x < l - Depart) + C [d (*-\<t(\\F(x) - F(x^)\\ Y )))f 

holds for all x G •M. amaa {P)- Now the function defined by £(s) :— do o cr(s) when 
s > with extension £(0) := is an index function. Namely, ( is continuous on (0, oo), 
since d is continuous. Moreover, the limit liniR^oo d(R) = implies lim t _ + o = an d 
this ensures the continuity of ( in 0. On the other hand, by Lemma [3721 one has that d is 
strictly decreasing. Thus ( is strictly increasing, and hence an index function. 

Because of < ^ < 1 this proves the theorem, since <p := ( q \ namely ip(0) = and 
(p(t) = [d (\l/ _1 ((r(£)))] 9 when t > 0, is an index function, too. □ 



Remark 5.3 Easily one can see that the rate function [d o o a] q (t) in the variational 
inequality of Theorem 15.21 tends to zero as t — > slower than the associated rate function 
a(t) in the variational inequality of Theorem 15. 11 Namely, taking into account the one-to- 
one correspondence between large R > and small t via *&(R) = cr(t) and ^(R) = 
we have for the quotient function 

a(t) V(R) 1 

- — -> as R -> oo, resp. t -»• . 



As a consequence the situation of approximate source conditions occurring in Theorem [575] 
leads to lower convergence rates of Tikhonov regularization obtained from Theorem 14.31 
than the situation of canonical source conditions that appears in Theorem 15.11 
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Example 5.4 Concerning logarithmic rates as an example we are going to conclude the 
paper with a brief study that outlines the specific potential of variational inequalities (14.41) 
for extracting both solution smoothness of £ and nonlinearity conditions on F at in 
one index function (p which determines the convergence rate. Let in that example with 
some C > 

**) = f° ( ' = 0) x M 

^ W [C [log(l/0]-" (0 < * < e^" 1 ) V ; 

hold. From that assumption we derive for all /x > immediately by Theorem 14.31 a 
logarithmic convergence rate 

AK< 5) ^ f ) = O ([log(l/<y)]-") as ^0, (5.3) 

which is slower than every power rate (14. 2D for any « > 0. Now the function (15. 2D with 
slow decay to zero as t — > can be a consequence of two completely different causes 
characterized by the following two situations (I) and (II), respectively: 

(I) Let a = ip, i.e., a very weak logarithmic structural condition (13.91 ) is valid, and let 
hold the canonical source condition (13.21) . which expresses in our context the strong 
smoothness assumption on the solution. Then by Theorem 15.11 in connection with 
Theorem 14.31 we obtain the logarithmic convergence rate (15.31) . 

(II) Let a(t) = t, i.e., a structural condition (13.91) is satisfied, which is the strongest in 
our sense. However, the canonical source condition ( 13 .2D is strongly violated, which 
is expressed by a logarithmic decay 

d(R) = {logR)- u 

of the corresponding distance function for some v > and all sufficiently large 
R > R > 0. However, since we have for all such R and for e > a constant K > 
with 

m) = I > 

y 1 R{logR)T ~ R!+c ' 

this implies ^ _1 (t) > Kt^ l ^ 1+E ^ for some constant K > and sufficiently small 
t > 0. Hence, by Theorem 15.21 the function ip in (I4.4D attains the form (15.21) with 

fj, = vq*. 
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